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Tij" ' Abstract 



We describe the structure of the graphs with the smallest average 



Q I distance and the largest average clustering given their order and size. 

There is usually a unique graph with the largest average clustering, 
which at the same time has the smallest possible average distance. In 

K/{ \ contrast, there are many graphs with the same minimum average dis- 

^ • tance, ignoring their average clustering. The form of these graphs is 



shown with analytical arguments. Finally, we measure the sensitivity 
to rewiring of this architecture with respect to the clustering coeffi- 
cient, and we devise a method to make these networks more robust 
with respect to vertex removal. 

1 Introduction 

Complex networks, as abstract models of large dynamical systems, match 
the structure of real-world networks in many diverse areas. These include 



both natural and man-made systems such as gene regulation, protein inter- 
action networks, food webs, economic and social networks and the internet, 
to name a few (see [3] and references therein). One of the prominent fea- 
tures that distinguishes them from random networks is the clustering among 
their individual units, quantified by their large clustering coefficient, and at 
the same time maintaining a small average path length among them. These 
properties were thought to be mutually exclusive, but in almost all real net- 
works, having a small average distance does not greatly interfere with the 
presence of a large clustering coefficient, as shown in [i]. Despite the consid- 
erable amount of literature in the area of complex networks, there has been 
no study exploring the properties of a graph required in order for it to have 
both large average clustering and small average distance. In this article, we 
will find the architecture of the networks with the largest average clustering, 
and show that at the same time their average distance is smaller or equal to 
the average distance of any other graph. We will also study their resilience 
to vertex and edge removal, and how sensitive their properties are to edge 
rewiring. 

2 Preliminaries 

A graph is an ordered pair Q = (V, S) comprising of a set V of vertices to- 
gether with a set £ of edges which are unordered 2-element subsets of V. 
Two vertices that are connected through an edge are called neighbors. All 
graphs in this study are simple, meaning that all edges connect two differ- 
ent vertices, and there is no more than one edge between any two different 
vertices. The order of a graph is the number of its vertices, |V|. A graph's 
size is \S\ , the number of edges. When there is no danger of confusion, we 
denote the graph with order A^ and size m as Q{N,m). A complete graph 
is a graph in which each vertex is connected to every other (with one edge 
between each pair of vertices). The edge density of the graph is defined as 
D = m/ ( 2 ) , representing the number of present edges, as a fraction of the 
number of edges of a complete graph. A clique in a graph is a subset of its 
vertices such that every two vertices in the subset are connected by an edge. 
A clique that consists of three vertices (and three edges) is called a triangle. 
The degree of a vertex is the number of edges that connect to it. A path 
is a sequence of consecutive edges in a graph and the length of the path is 
the number of edges traversed. A path with no repeated vertices is called a 



simple path. A graph is called connected if for every pair of vertices u and v, 
there is a path from u to v. Otherwise the graph is called disconnected. The 
distance d{u, v) for a pair of vertices u and v in an undirected simple graph Q 
is the shortest path from u to v (and vice versa). A cycle is a closed (simple) 
path, with no other repeated vertices or edges other than the starting and 
ending vertices. A cycle is called chordless when there is no edge joining two 
nodes that are not adjacent in the cycle. A cut vertex of a connected graph 
is a vertex that if removed, (along with all edges incident with it) produces 
a graph that is disconnected. A tree is a graph in which any two vertices are 
connected by exactly one simple path. Finally, a subgraph "H of a graph Q is 
said to be induced if, for any pair of vertices u and v of "H, {u, v) is an edge 
of "H if and only if {u, v) is an edge of Q. In other words, "H is an induced 
subgraph of Q if it has all the edges that appear in Q over the same vertex 
set. If the vertex set of 1-i is the subset S of V(^), then l-L can be written as 
Q[S] and is said to be induced by S. 

We will solve the problem of finding the graph which has the smallest 
average distance, and the largest average clustering coefficient, for given order 
A^ and size m. 



3 Average Clustering and Average Distance 

The local clustering coefficient of a vertex u is defined as the number of con- 
nections between vertices that are neighbors of m, divided by the total number 
of pairs of neighbors of m [ ]. It is the number of triangles in which u partic- 
ipates divided by the number of all possible triangles it could participate in, 
if all its neighbors were connected to each other. 

More formally, if du is the degree of a vertex m, and tu is the number of 
edges among its neighbors, its clustering coefficient is 

if ci„ = 
C{u) = <j 1 if c?„ = 1 (1) 



(t 



if d, > 2. 



An example is shown in Figure 1. The clustering coefficient of a vertex can 
only take values in the interval [0,1]. Note that we deliberately choose to 
define the clustering coefficient of a vertex u with degree (i^ = 1 as equal 
to 1. The graphs with the largest clustering under this convention may be 



different when we assume that vertices with degree 1 have zero clustering. 
The method to find the graphs with the largest clustering under the latter 
assumption is similar, and will be described later. 

The average clustering coefficient for a graph Q is simply the average of all 
the local clustering coefficients in its vertex set V{Q). A large average clus- 
tering coefficient is a proxy for increased robustness of the network, "local" 
structure and increased connection density among vertices in a neighbor- 
hood [2]. If A^ is the order of the network, the average clustering coefficient 
is defined as 

^(^) = 1^ E ^H- (2) 

uev{g) 

Since we will only be comparing graphs with the same number of vertices 
and edges, in order to make our analysis easier, we will be considering the 
sum of the clustering coefficients of all the vertices: 

Cs{Q) = Yl c*!^)- (3) 

u&v{g) 

Maximizing Cs{Q) is equivalent to maximizing C{Q). If a network has both 
a high average clustering coefficient, and a small average distance, it is called 
a "small world" network [ ]. This architecture is conjectured to have other 
desired properties, like enhanced signal propagation speed, synchronizability 
and computational power [1],[3]. As it turns out, the networks with the 
largest average clustering are "small world" networks, since they also have 
the smallest possible average distance. 



4 Networks with the Smallest Average Dis- 
tance 

The average distance of a network is an important property, since it is di- 
rectly correlated with how different parts of the network communicate, and 
exchange information. A small average distance is a proxy for improved 
synchronizability, efficient computation and signal propagation across the 
network [3]. In this section, we will analytically compute the minimum aver- 
age distance of a graph of given order and size, and find a sufficient condition 
in order to achieve that minimum. 






(a) 



(b) 



(c) 



Figure 1: (a) The clustering coefficient of a vertex is the number of connections 
between its neighbors, divided by all pairs of neighbors, whether they are connected 
by an edge or not. The clustering coefficient of the blue vertex u is C{u) = 4/(2) = 
0.4. The average shortest path length is Lg = 1.48. (b) After rewiring one edge, 
and decreasing the distance between the red and the green vertex, the network has 
the smallest possible average distance, equal to 1.43. All vertex pairs now have at 
most a distance of 2. (c) The network with the largest possible average clustering 
and smallest average shortest path length. The blue vertex is the central vertex 
of the induced star subgraph. 



Lemma 1. Assume we have a graph Q{N,m) with average distance Cg, and 
we introduce one additional edge. The graph Q'{N,m + 1) has an average 
distance Cq' < -^g- 

Proof. Assume that the edge connects two previously unconnected vertices s 
and t, changing their distance to d'{s,t) = 1. Since they were not connected 
before, their distance was d{s,t) > 2 > d'{s,t). For every other pair of 
vertices u and v, d'{u,v) < d{u,v), and hence, 



vsJ {u,v)GV^{g) V2J {u,v)ev^{g) 



d{u,v) = Cg- 



(4) 



D 



Lemma 2. The star graph is the only tree of order N with the smallest 
average distance equal to Cstar 



-^ AT- 



Proof. A tree has exactly A^ — 1 edges among its A^ vertices. As a result, 
there will be exactly A^ — 1 pairs of vertices with distance d = 1, and ( 2" ) 



vertex pairs {u,v) that are not connected, with distances d{u,v) > 2. The 
star graph achieves this lower bound, and has the minimum possible average 
distance. It is also unique. Assume that the tree is not a star, and as a 
result, there is no vertex that is connected to all the remaining vertices. In 
this case, there are at least two vertices with distance d > 3, since in every 
tree there is a unique path connecting each pair. D 

Using the lemma above, we will find the smallest possible average distance 
of a graph with A^ vertices and m edges, which we denote Cj^^rn- 

Theorem 1. The minimum possible average distance of a graph Q{N,m) is 

equal to CN,m = 2 — -7^- ■ 
[2) 

Proof. The graph Q{N,m) has m pairs of vertices with distance exactly 1, 
and the rest of the pairs of vertices {u,v) have distances d{u,v) > 2. As a 
result, the minimum average distance is 



m + 2 ((^)-m) _^ m 



C, > - \\\' ■'■' =2-^. (5) 



This lower bound can always be achieved. A connected graph Q{N,m) with 
a vertex u of degree du = N — 1 has an induced star graph, and as a result all 
non- neighboring vertices will have distance equal to 2. All connected vertices 
have distance equal to 1, leading to the lower bound of equation (5). D 

Corollary 1. // a graph Q has two vertices u and v with distance d{u, v) > 3, 
then its average distance is Cg > CN,m- 

Proof. The number of pairs with distance 1 is fixed, equal to the graph's 
size. All other vertices have a distance of at least 2, and since the mini- 
mum is achieved when all non-neighboring pairs have a distance equal to the 
minimum, the average distance of Q is Cg > Ciy^m- D 

It is worth noting that a graph Q may have average distance Cg = CN,m 
without having a star graph as an induced subgraph. This is possible only 
if every pair of non-neighboring vertices has distance d = 2. An example is 
shown in Figures 1(a) and 1(b). The blue and green vertices had distance 
equal to 3 at first, and after rewiring one edge to connect them, the average 
distance became equal to the smallest possible. It is easy to see that networks 



with the smallest distance need to have a star-like architecture. Every pair 
of nodes needs to be part of at least one induced subgraph that is a star. 

The above observations also propose an easy way to quickly decrease 
the average distance of a network by rewiring a small number of edges. At 
each step, we rewire an edge that is part of the most triangles, to connect 
the two vertices in the network with the largest distance. The previously 
connected vertices have distance 2, which is the smallest possible distance 
for unconnected vertices, and the newly connected vertices have the largest 
possible decrease in their distance. Note that this method contributes to a 
large decrease in the average clustering of the network at the same time. 

5 Recursive Computation of Graph Cluster- 
ing 

Assume that we add one vertex u with degree d to a graph Q{N,m), by 
connecting it to the vertices in a set V {\V\ = d) and the result is a new 
graph Q'{N + l,m + d). The difference of the sum of clustering coefficients 
of the two graphs will be 

ACsiG', Q) = Cs{g') - Cs{g) = C{u) + ACsiG', G, V) (6) 

where C{u) is the clustering coefficient of the new vertex, and 

ACs{Q\G,V) = Y,{C\v)-C{v)) (7) 

vev 

is the sum of the differences of the clustering coefficients of the vertices in P, 
before and after they acquire their new edge. All other vertices have their 
clustering coefficient unchanged. 

Lemma 3. Assume that we have a graph Q{N,m) with sum of clustering 
coefficients Cs{Q), and we add one more vertex u, with degree d. The clus- 
tering difference of the two graphs ACs{Q',Q) = C{u) + ACs{Q',Q,T>) can 
only be the largest possible if the vertices in T> are part of a clique Q. 

Proof. If all vertices in T> are part of a clique, they form an induced com- 
plete subgraph. The clustering coefficient of u will be the maximum possible 
{C{u) = 1), since all the possible connections among its neighbors will be 



present. Also, each of the d vertices of Q (with \Q\ = q > d) that u is 
connected to, will increase the number of connections among their neighbors 
to the maximum extent (given the degree of u), and they will have d — 1 
additional triangles each. The clustering coefficient of the rest of the q — d 
vertices of the clique will not be affected. If, on the other hand, u forms 
connections with vertices that do not form a clique, its clustering coefficient 
will be C{u) < 1, and the vertices it is connected to will have less extra 
connections among their neighbors. D 

Note that when d < q, the new clustering coefficient of the vertices that 
will be connected to u will be less than 1, possibly smaller than it was before 
connecting the new vertex. In order to minimize the effect of the missing 
triangles on the overall clustering coefficient, we need to make sure that we 
connect u to those vertices of the clique with the largest degree (which may 
have additional connections outside the clique). Also, the size of the clique 
is an important factor. The larger the clique, the smaller the impact of the 
additional vertex, since the degrees of the connected vertices are larger, but 
on the other hand we may have more missing triangles in total. 

Lemma 4. Assume that we have a graph Q that consists of a clique Q and 
two additional vertices A and B that each connects to a subset T>i and T>2, 
with di = \Vi\ < \V2\ = d2-The average clustering of Q is maximized when 
Di C V2. 

Proof. The number of missing triangles stays the same regardless of the spe- 
cific vertices of the clique that A and B are connected to. So, the goal is to 
redistribute the missing triangles to vertices of the largest possible degree. 
When Vi C V2, the number of vertices of Q with clustering equal to 1 is 
maximized, and at the same time, the vertices of Q with the largest degree 
have the maximum number of missing triangles. D 

The above lemma can be used recursively for any number of edges con- 
necting to a clique. An example is shown in figure 2. 

Assume we have a graph Q which belongs to the set SN,m of all graphs 
with N vertices and m edges. We denote the clustering sum of the optimal 





(a) 



(b) 



Figure 2: An example of a graph with two vertices connected to a chque. The 
graph in (a) has a smaller sum of clustering coefficients Cs than the one in (b). 
By rewiring one edge in the graph on the left, we shift the missing triangles of one 
orange vertex to a blue one, with a higher degree. So the missing edges between a 
green and a white vertex have a smaller effect on the overall clustering. 

graph with Cs{N,m). Its clustering sum will be 



Cs{N,m) 



max 

5G5jv,m 



E^i' 



uev{g) 



(8) 



max 

5G5jv,m 



Y, C{u)]+C{v) 



.u£V{g)\v 



(9) 



for any vertex v G V{Q)- The reasoning behind the last equation is that if 
we pick any vertex from the graph, it should be "optimally" connected to 
a smaller graph that is itself optimal. This means that we need to make 
sure that the chosen vertex has the largest possible clustering coefficient 
with regard to the vertices it is connected to, the rest of the graph needs 
to have the largest clustering coefficient, and the potential decrease of the 
clustering coefficient of the vertices after connecting the last one should be 
minimal. This method provides an easy way to find the maximum clustering 
of a graph Q of order A^, by using a graph Q, of order A^ — 1, and connecting 



one additional vertex to it. The above can be algebraically expressed by 
conditioning on the set T> of vertices with which the additional new vertex v 
of degree d = \'D\ forms connections with. 

Cs{N,m)= max {CsiN - l,m - d) + Civ) + ACsiN - l,m - d,V)} 
VQV{go) 

(10) 

where ACs{N — l,m — d,V) is the change in the clustering coefficient of the 

d vertices of the set V of the graph of order A^ — 1 and size m — d when we 

connect one vertex (vertex v) with degree d to them. Again, the last equation 

argues that the graph with the largest possible clustering coefficient can be 

found by connecting a vertex to an optimal graph with fewer vertices and 

making sure that the algebraic value of the change is as large as possible. 



6 Almost Complete Graphs 

We call a connected graph of order A^ and size m almost complete when its 
largest clique has order A^ — 1 or A^ — 2. We distinguish these two cases 
by calling them type I and type II respectively. In order to be almost 
complete, a graph needs to have ( ~^) +1 <m< {2) ~ ^ (type I), and 
( ~^) + 2 < m < ( 2^^) (type II) edges. The vertices of the largest clique 
are called central vertices, whereas the vertices not belonging to it are called 
peripheral vertices. The two types of almost complete graphs are shown in 
Figure 3. 



Lemma 5. The total clustering for a type I almost complete graph Q{N,m) 

is Cs{Q) = N — ° /j7_iT , where a is the degree of the peripheral vertex. 
\ 2 ) 

Proof. Assume that we have a type I almost complete graph as described 
above. The peripheral vertex will have a clustering coefficient equal to 1, 
the a vertices with degree A^ — 1 will have a clustering coefficient C{u) = 

^ ^/iv-i\ — ; and the A^— 1 — a vertices with degree A^— 2 will have a clustering 

[ 2 ) 
coefficient equal to 1 (see figure 4(a)). The sum of the clustering coefficients 
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(a) Type I 



(b) Type II 



Figure 3: (a) The type I almost complete graph consists of a clique of A^ — 1 
vertices, and one peripheral vertex (shown in red) that connects to them, (b) The 
type II almost complete graph of order A^ consists of a clique of order N — 2, and 
two additional vertices that connect to it (and possibly to each other). 



for this graph will be 

CsiQ) = l + a 

= N + 
= N + 

= N- 



iV) + («-!) 



+ (A^-l-a) 



-a(V)+«(V)+«(«-l^ 





-1) 


a{N-l-a) 





iV) 



(11) 

(12) 

(13) 

(14) 

D 



The average clustering, as a function of a, is convex and symmetric 
around ^^y^- It decreases as a goes from 1 to ["^^^J, and then it increases 



as a goes from [— j— ] to A^ — 1. 



Lemma 6. A type I almost complete graph has larger clustering than any 
other nonisomorphic graph of the same order and size. 
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(b) 



(c) 



Figure 4: Clustering for the almost complete graphs, (a) The peripheral vertex 
(red) has unit clustering since it is connected to vertices of a clique. The vertices 
that are not connected to it (green) also have clustering equal to 1, as members of 
the complete graph with no other edges. The vertices that are connected to the 
peripheral vertex (blue) have clustering less than 1, since the peripheral vertex is 
not connected to all of their neighbors, (b) A type II almost complete graph with 
the largest clustering, where the two peripheral vertices are not connected (type 
Ila). (c) A maximum clustering coefficient type II almost complete graph, with 
connected peripheral vertices (type lib). 

Proof. Assume a type I almost complete graph J-" that consists of a chque P 
oi N — 1 vertices, and one peripheral vertex of degree a that is connected to 
it. ^ is a graph of the same order and size, whose largest clique Q consists 
oiN — d<N — 2 vertices (otherwise it is isomorphic to J-'). The additional 
d > 2 vertices form connections to Q and among each other. Also in this 
case, we call these d vertices peripheral, and the N — d vertices of Q central. 
Define 7 = (2) ~ ^^j which corresponds to the number of non-neighboring 
vertices in a graph. By assumption, 1 < 7 < A^ — 2. It is easy to see that 
(i < 7, since we have only 7 edges missing, and for each peripheral vertex u, 
at least m„ > 1 edges between itself and Q have to be missing, otherwise Q 
is not the largest clique. We will add all of the clustering coefficients for all 
the vertices, and then show that the sum is always greater for J-". 
We note the following: 

• The number of vertices with clustering coefficient equal to 1, is 1 + 7 
for J-" and at most d+ [^J for Q. The number of such vertices is smaller 
or equal in Q than in J-", for all 2 < (i < 7. Conversely, J-" has exactly 
A^ — 1 — 7 vertices with clustering coefficient less than 1, and Q, has at 
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least N — d — [^J such vertices. 

• All vertices that have a clustering coefficient smaller than 1 in J-" have 
degree A^ — 1, the largest possible degree in a graph with A^ vertices. 

We will find the sum of the clustering coefficients for both graphs in terms of 
the number of triangles missing. Then, we will show that Cs{J^) > Cs{G), 
for any Q i^ T . 

Since 7 = A^ — 1 — a, the sum of the clustering coefficients for J-", is 



7( A^-l- 7) 



Cs{T) =N- '' ^^_,^ ^^ (15) 



The sum of the clustering coefficients for Q is 

N 



Cs{g)=N-J2j^ (16) 

M=l \2 ) 



iV) 



<N-HW^ (17) 



-^""(% (18) 

where Wu is the number of edges missing among the neighbors of vertex u 
and Sg = J2u=i Wu- 

In order to prove the lemma, we only need to prove that the missing 
triangles {Sg) of Q are more than those [sj) missing in J-", in other words that 

N 

/^ = Sg-Sf = Y,Wu-l{N-l-^) (19) 

u=l 

is greater than or equal to zero. 

We will now find the minimum number of central vertices that are con- 
nected to at least two peripheral vertices. This happens when d—2 peripheral 
vertices have the largest possible degree A^ — ci — 1, and the remaining two 
have the smallest possible degrees, and consequently the minimum number of 
clique vertices that they are both connected to. If we assume that h edges are 
not present among peripheral vertices, we have a total of t = d{N — d) —7 + 6 
edges between peripheral vertices and central vertices. If we further assume 

13 



that all but 2 peripheral vertices have the largest possible number of edges 
with the clique (which is N — d — 1), the remaining peripheral vertices will 
have a degree sum oi r = t — {d — 2) {N — rf — 1) = 2N — rf — 2 — 7 + 6. Since 
we only have N — d vertices in Q, at least r — (A^ — d) = N — 2 — '-f + boi 
them will be connected to two peripheral vertices. 

The total number of triangles missing from the central vertices, because 
of edges missing between peripheral and central vertices, is 

d 

Mc-P = Y,^u{N-d-m^) (20) 

u=l 

(d \ d 

J2mA-Y,ml (21) 

M = l / M=l 

The number of triangles missing because of the absence of edges among 
peripheral vertices, as shown above, is at least 

b 

Mp_p> ^(A^- 2-7 + 6) = 6(A^ -2 -7 + 6). (22) 

e=l 

We are now ready to count the number of triangles missing from Q. We 
omit the triangles missing from the peripheral vertices, and assume that 
all peripheral vertices have clustering equal to 1 (otherwise the number of 
missing trinangles in Q would further decrease). The total number of triangles 
missing for Q is 

Sg > Mc-P + Mp_p (23) 

/ d \ d 

>(Ar-rf) VmJ-Vm2+6(Ar-2-7 + 6) (24) 




>{N -d){-i-h)-^ml + h{N -2--i + h) (25) 



u=l 



with 'Yliu=i "^M = l~b, since 7 edges in total are missing. In order to minimize 
Sg, we need to maximize the sum of squares X]m=i "^L under the constraints 
rriu > 1 for u = 1,2, .. .d and X]m=i ^^m = '~f — b. The maximum is achieved 
when rriu = I ior u = 1,2, . . .d — 1, and rrid = 7 — 6 — (c? — 1). As a result. 



Sg 



> {N - d){-f - b) - {{d - 1) + {-f - b - d + if) + 6(iV - 2 - 7 + 6) (26) 

14 



Sg > 7^ - 7^ + Tc? - &c? + c/ + 7& - 27 - d'^. (27) 

The difference between Sg and sj is 



= ^9-^f 


(28) 


> ^iV - 7^ + 7^ - 6rf + rf + 76 - 27 - rf^ - 7A^ + 7^ + 7 


(29) 


> 'jd - bd + d + 'jb - 'J - (f 


(30) 


> (7-rf)(6 + rf-l). 


(31) 



By assumption, d >2 and 6 > 0. The above product is always positive, when 
(i < 7. If c? = 7, we have many fully connected peripheral vertices, each of 
which connect to all but one vertex of Q. No two peripheral vertices are 
connected to the same central ones, for in this case Q would not be maximal. 
As a result, there are less than 7 vertices in Q that have unit clustering, and 
more than A^ — 1 — 7 vertices in Q that are missing one or more triangles, 
and in that case too, Sg > Sf. So in every case, Cs{J^) > Cs{G)- □ 

There are two cases of type II almost complete graphs, depending on 
whether or not there is an edge between the two peripheral vertices. If there 
is no such edge (type Ila), the graph with the largest clustering takes the 
form of Figure 4(b). When the two peripheral vertices are connected (type 
lib), the graph with the form shown in Figure 4(c) has the largest average 
clustering coefficient, as shown in the next lemma. 

Lemma 7. Assume a type II almost com,plete graph Q{N, m) with two pe- 
ripheral vertices u andv. Ifu andv are not connected, the average clustering 
coefficient is maximized when u and v have the smallest number of common 
neighbors. If v and v are not connected, the graph with the largest clus- 
tering coefficient is the one where they have the largest number of common 
neighbors. 

Proof. Let c = m — (^ ~)be the total number of edges connecting the two 
peripheral vertices to the rest of the graph. Without loss of generality, assume 
that u has smaller degree than v, d{u) < d{v). Then according to Lemma 
A, c = du + dy = 2a + b, where a is the number of their common neighbors, 
and b = dy — du the neigbors of v that are not connected to u. The sum of 
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clustering coefficients will be 

l + {N-2-a) + {N-2-a-h) N-2-a-b 
C,{g) = N -a p— b -^ 

(32) 
_ c-b 2N-3~c_ N-2-^^-b 

Since c is a constant, the sum of clustering coefficients is a function of b, and 
by differentiating, we find that it is increasing with b for all 2 < c < A^ — 2 
and < 6 < c - 2. 

Similarly, when there is an edge between u and v, c = 2a + 6 + 1, and the 
sum of vertex clustering coefficients is 

(N-2-a) + iN-2-a-b) N-2-a-b b 

(34) 
-AT- ^-^-^ 2iV - 3 - c _ iV-2-^ b 

— ^ 9 ■ fN-l\ "' (N-2\ + , b+c+i . ■ (•35) 

The last equation is a decreasing function of b for all 2 < c < A^ — 2 and 
< 6 < c — 3. From the above, we find that if a type II almost complete 
graph has maximum average clustering coefficient, there will be a peripheral 
vertex with degree 1 (type Ila), or both vertices will have the maximum 
number of common neighbors (type lib). D 

For a given size m of a type II almost complete graph, we need to decide 
which of the two variations has the largest clustering. We can easily see that 
the first two terms in both equations differ in the fact that c and c — 1 edges 
connect a peripheral with a central vertex respectively. In the lib type, if 
6 > 1, the third term in equation (35) becomes very large, compared to the 
other terms for any A^ > 5, and a necessary condition in order to have a type 
lib almost complete graph is that 6 = (the two peripheral vertices have 
exactly the same neighbors), which means that c has to be odd, c = 2a + 1. 
Then, a simple comparison of the two equations shows that if 

^ -1 + 3Ar + J2 (12 - iV - Am + N^) 

' ^ TT^v ('') 
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then the type Ila has larger average clustering than type lib. The above 
number scales proportionally to the square root of the order N. 

Lemma 8. A type II almost complete graph has larger clustering than any 
other graph of the same order and size. 

Proof. We will use induction. The claim is true for graphs of order A^ = 4, 
shown by exhaustive enumeration of all the graphs with 4 vertices. Now, we 
will assume that it is true for all graphs of order up to A^ — 1 , and will show 
that this is still true for a graph of order A^. The optimal graph will be found 
by using equation (10). The connected graph of order A^ — 1 with the largest 
clustering coefficient is either a type I or type II almost complete graph for 
all possible degrees d of the additional vertex w, because 1 < d < A^ — 1, and 
by assumption (^2"^) + 2 < m < (^2^^), so 

N-3\ ^ fN-l\ 

If it is a type I almost complete graph, addition of one vertex will transform 
it into an almost complete graph of type II, regardless of its degree. Conse- 
quently, the lemma holds in this case. Now assume that the existing graph 
is a type II almost complete graph. Assume that the two already existing 
peripheral vertices u and v have a and /3 edges with vertices of the largest 
clique, li a + (3 + d = N — 1, then it is easy to see that the optimal graph 
consists of a full graph of order N — 2, and two vertices of degree 1 that 
connect to it. For every other value oi a + P + d, since the initial graph 
is type II almost complete, d > 2, and w has at least one neighbor in the 
clique. Considering u, v and w in pairs, we can show that a graph with 
three peripheral vertices cannot be optimal, in other words, not all three 
can have less than A^ — 3 edges to the clique of order A^ — 3. The reason is 
that according to equation 9, any choice of a single vertex from the graph 
should yield the same result, in terms of maximizing the sum of clustering 
coefficients. By the induction hypothesis, in each pair, one of the peripheral 
vertices should have one connection to the clique, (not possible under the 
constraint a + /3 + d > N — 1) or all of them should be connected to each 
other and to the same vertices of the clique (also impossible for the same 
reason). In all cases, a type II almost complete graph will have the largest 
average clustering coefficient. D 
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7 Graphs with the Largest Clustering 

In this section, we will combine the previous results to show the form of the 
graphs with the largest clustering for a graph of arbitrary order and size. 

Lemma 9. The largest clustering graph with N vertices and < m < N — 2 
edges consists of complete components of 2 or 3 vertices each. 

Proof. For m=0, we have no edges and the clustering coefficient is equal to 
zero. If r/i > 0, we first connect pairs of vertices, until all of them have 
degree 1. If we have any edges left, we start forming triangles, by trying to 
keep the vertices that do not have any edges at a minimum. In a triangle, 
we have the same number of vertices and edges, and since m < N — 2, the 
number of edges is not enough to connect all the vertices in triangles. The 
above procedure will guarantee that the disconnected graph will have a sum 
of clustering coefficients equal to 1. D 

Lemma 10. The tree that has the largest clustering (according to the con- 
vention that a vertex with degree 1 has clustering 1) is the star graph. 

Proof. Since there are no cycles in a tree (which also means no triangles), 
a vertex with degree larger than 1 will have a clustering coefficient equal to 
zero. By minimizing the number of such vertices (one vertex is the minimum 
number since the graph needs to be connected), we achieve the largest clus- 
tering for the star graph. Also, note that the star graph is a graph with one 
cut vertex, connected to several complete graphs of order 2. D 

Theorem 2. The graph with N vertices and N < m < (^ 2')"'"-'- ^^9^^ ^^^^ 
has the largest possible clustering coefficient consists of one cut vertex that is 
shared by complete or almost complete subgraphs. 

Proof. We will use induction on the number of vertices to find the optimal 
graph for A^ < m < (\^) + 1. 

For N=4, the statement is found to be true, by evaluating all the possible 
graphs. (For computational considerations, see [4, 5]. Assume that the op- 
timal graph for every number of vertices up to A^ — 1 and for the respective 
range of sizes, has the form mentioned above. 

The graph with A^ vertices and m edges will be found by connecting a 
new vertex of degree d to an optimal graph of order A^ — 1 and size m — d. 
The resulting graph will have the maximum possible clustering of the new 



vertex (equal to 1), the inaximum clustering of the rest of the graph it is 
attached to, and the minimal possible decrease in clustering for the vertices 
it is connected to. 

If d is larger than the order of the largest subgraph in ^(A^ — 1, m — (i), 
then it will be connected to at least two smaller complete or almost complete 
subgraphs. Now consider the subgraph that consists of these subgraphs plus 
the added vertex of degree d. It has P < N vertices and i? < ( ^^) + 2 edges. 
In addition, it has a cut vertex, which connects its two or more components. 
This is a subgraph whose form is not optimal, according to the induction 
hypothesis, meaning that there is a smaller degree d for the vertex m, for 
which equation (10) gave a larger clustering coefficient for order P and size 
R. So we only need to consider vertices with degree d less than the size of 
the largest "module" if ^(A^ — l,in — d) is already connected. Moreover, we 
only need to try to connect it to one clique. The rest of the cliques will not 
change, so we focus on the clique where the new vertex with the new edges 
is added, and prove that it will still have the same form. If it is a complete 
graph, then after adding the new vertex m, it will have one peripheral vertex, 
which makes the claim hold (the complete graph will now be a type II almost 
complete graph). If it is a type I almost complete subgraph, after the addition 
of the new vertex, it will become a type II almost complete subgraph, as 
shown above. If the subgraph is a type II almost complete subgraph, then 
connecting u to it cannot yield an optimal subgraph, since by increasing the 
degree of u to the size of this subgraph, we can make a graph with larger 
clustering, as shown in the lemma about the optimal form of type II almost 
complete graphs. In any case, the new graph will have the form described in 
the claim. D 

The arguments above just prove the form that the optimal graph needs 
to have, and not the exact arrangement of edges among its vertices. Using 
equation (10), we can find the optimal graph in polynomial time, and as 
shown above, we only need to consider a small range of different degrees for 
the added vertex. 

Corollary 2. According to the above analysis, the graph Q{N, m) with the 
largest possible clustering coefficient has one of the following forms, depending 
on its size m: 

• < m < N — 2: Disconnected graph, consisting of complete compo- 
nents with 2 or 3 vertices each. 
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• A^ — l<?Ti<( 2 ) + -'-■ Complete or almost complete subgraphs that 
share one vertex. 

• ( ~^) + 2 < m < ( ~^) : Type II almost complete graph. 

• ( 2^^) '^ ^ — ''^ — {2) ~ ^'- '^yP^ -'■ almost complete graph. 

• m= (2)- Complete graph, with average clustering (7=1. 

8 Properties of the Graphs with the Largest 
Average Clustering Coefficient 

1. The form of the optimal graphs is recursive. Every combination of 
clusters (modules) that are connected through a cut vertex themselves 
form an optimal graph of the respective order and size. This suggests an 
alternative way to generate an optimal graph, where it can be found as 
a combination of smaller complete or almost-complete subgraphs. We 
can find the optimal form recursively with this procedure, too. Since 
there is always a cut vertex, we can break the network in two smaller 
parts, that do not affect each other's average clustering coefficient, and 
optimize them independently. 

2. When a new vertex is attached to a graph with the process described 
above, it is always attached to vertices that are members of the same 
clique. This property can make computation easier, since we only need 
to consider cliques of size equal or greater to the degree of the candidate 
additional vertex, according to equation (10). 

3. No graph with the largest average clustering has any chordless cycles. 

4. No graph with the largest average clustering has any induced bipartite 
subgraphs. 

5. Every cut set is a complete graph of size 1 or more. In addition, if 
there exists a cut vertex, it also belongs to all larger cut sets. 

6. For all connected optimal graphs, there is at least one vertex that is 
connected to every other vertex in the graph. If the graph is not almost 
complete, then the central vertex is unique. Otherwise, we may have 
more than one such vertex. 
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7. The networks with the largest clustering coefficient also have the small- 
est possible average distance among the various vertices. For every 
pair of vertices that are not connected, there is a path of length 2 that 
connects them. These networks are classified as "small world", be- 
cause they have both high average clustering and small average shortest 
paths, as mentioned previously. 

8. From a computational standpoint, in order to find the optimal graph 
^(A^, m), we first need to find the optimal graphs of order A^ — 1, for all 
sizes, which in turn means that we need to compute all such graphs of 
all orders up to N — 1. The complexity of this procedure is polynomial, 
0{N^), since we need to find the optimal graphs for all graphs of order 
up to A^, each of which has O^N"^) edges, and each time we need to 
add one vertex, trying various degrees d = 0{N), to all appropriate 
0{N) cliques. In order to avoid computing recursively the same optimal 
graphs, we need to start from the smallest possible optimal graph (for 
A^ = 3), and build our way to the desired A^, and storing all the optimal 
graphs to memory. If we had to perform an exhaustive search, the 
complexity would be prohibitively large, C(2^ ). 

9 Fast Generation of Graphs with Small Dis- 
tance and Large Average Clustering 

From the previous sections, it is now known that the form of the graphs with 
the smallest distance and largest clustering take the form of complete or 
almost complete subgraphs that are connected through one vertex. In order 
to generate very large networks that have this form, we can resort to a much 
simpler algorithm that has almost constant complexity, and can generate 
networks with arbitrarily many vertices, with the minimal average distance 
and very close to optimal clustering coefficient. Given the order and size of 
the graph, we find the largest complete subgraph, which leaves enough edges 
for the rest of the network to be connected. Then we subtract the number 
of vertices and edges used, and repeat the process until all the vertices and 
edges have been used. If at any point during the process, it is found that we 
cannot form a subgraph with the number of vertices and edges, we backtrack 
and reduce the cluster in the previous step (see ["]). 
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10 Clustering Coefficient Sensitivity 

We have numerically computed the sensitivity of the network clustering under 
edge rewiring. In Figure 5, we show the evolution of the average clustering 
coefficient after rewiring a percentage of the edges, for a typical large network. 
If we rewire a small number of edges, this has no notable effect on the average 
clustering coefficient, since most of the vertices have only a small amount of 
neighbors that are not in their previous clique. This is a similar situation as 
in [1], with the only difference that the distance is very small from the start, 
and quite robust to rewirings, increasing less than 0.1% even after we rewire 
a number of edges equal to the size of the network. 
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Figure 5: Typical change of the clustering coefficient of a network with N=500 
vertices and various edge densities D . At every step one edge is rewired and 
the new average clustering coefficient is measured. We repeat the process for a 
number of steps equal to the number of edges in the graph. The rate of decrease in 
clustering is very small at ffist, but increases as we introduce more randomness in 
network architecture. Larger networks are more robust to rewiring a fixed number 
of edges. 
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(a) 



(b) 



(c) 



Figure 6: (a) The graph with the largest clustering coefficient of order 11 and size 
22 consists of three complete subgraphs. Removal of the central vertex will render 
the network disconnected, (b) Addition of a few new edges among the different 
subgraphs creates new alternative communication paths among the various vertices 
of the network. As a result, the minimum cut set of the network is increased. (c) 
Adding new edges that connect one edge with vertices in the other subgraphs 
essentially creates a new network with more than one central vertex. 

11 Resilience to Vertex or Edge Removal 

The small- world networks studied here are very robust to edge removal. Since 
almost every vertex is part of a complete graph, we need to remove at least 
as many edges as the order of the smallest clique it belongs to in order to 
render the network disconnected. Even in that case, the number of vertices 
disconnected is at most equal to the number of edges deleted. 

The situation is different for vertex removal. We immediately note that 
unless we have enough edges to build an almost complete graph, the network 
becomes disconnected if we remove the central vertex, and the number of 
disconnected components is the number of modules in the initial network. 
Real world networks on the other hand, rarely have an articulation point. The 
robustness of a network to vertex failure is determined by its smallest cut set, 
and in this type of network, it consists of a single vertex (the central vertex), 
removal of which will render it disconnected. Depending on the application, 
we may be able to add new edges, which will increase the network's robustness 
to vertex failure. There are many ways to add the new edges, all of which 
result in reduced clustering. One of them is to distribute the new edges among 
vertices of the various modules, as shown in figure 6(b). This method ensures 
that if the central vertex is removed, there are still communication channels 
among the different subsystems of the network. Another way is shown in 
figure 6(c), where one or more of the vertices forms new connections with 
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(a) 



(b) 




Figure 7: Graphs with the largest clustering coefficient, defining the clustering 
coefficient of vertices with degree d = 1 as equal to zero, (a) The optimal graph 
for G{8, 13) has two cut vertices, connected through a bridge, (b) The smallest 
type II almost complete subgraph consists of a clique with N — 2 vertices, and both 
peripheral vertices have degree at 2. The peripheral vertices are shown in red. (c) 
The smallest type I almost complete graph under the new convention, where the 
peripheral vertex has degree 2. 



the vertices of all the other subgraphs, which in essence increases the number 
of 'central' vertices. Rewiring or adding new edges will have the least effect 
when they connect vertices with a large degree. 



12 Alternative Definition for Vertices with 
Degree 1 

A common alternative convention for the vertices with degree 1 is to define 
their clustering coefficient as equal to 0. The process of finding the graphs 
with the largest clustering under this new convention is very similar to the 
previous case. The biggest difference is the form of graphs with a small 
number of edges, when they consist of at least one subgraph that is a tree. 
For m < N — 1, the graph is disconnected, and the optimal form consists of a 
group of disconnected triangles. When m = N — 1, the only form a connected 
graph can take is a tree, in which case, any arrangement of the vertices will 
yield a clustering coefficient of zero, since there can be no triangles. As m 
increases, we are able to form the first triangles, and the vertices that have 
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nonzero clustering are part of a triangle, and have one more edge to the 
rest of the graph, keeping their degree low. As the number of edges increases 
further, and every edge is part of at least one triangle, the form of the optimal 
graphs resembles the form under the previous case, with only one difference, 
and after which point, no vertex has degree 1. Some graphs, instead of having 
a unique cut vertex, have two cut vertices that are connected instead through 
one bridge edge, as shown in Figure 7(a). (Under the initial convention, the 
same graph would consist of the same two modules, plus one vertex with 
degree 1, connected to the single cut vertex). The last difference is that 
the definition of the almost complete graphs has to be changed so that the 
peripheral vertices have a degree of at least 2 (Figures 7(b) and 7(c). The 
process of finding the optimal graphs remains otherwise the same (see [5]), 
and an example for a graph of order 10 is shown in Figure 9. Comparing 
figures 8 and 9, we immediately see the similarity of the optimal graphs in 
both cases. 
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Figure 8: Graphs of order 10 and size 9 < m < 45 with the largest clustering 
coefficient, assuming that vertices with degree 1 have clustering coefficient equal 
to 1. 
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Figure 9: Graphs of order 10 and size 10 < m < 45 with the largest average clus- 
tering coefficient, assuming that vertices with degree 1 have a clustering coefficient 
equal to zero. 
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